Compatibility and noncontextuality for sequential measurements 
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I. INTRODUCTION 

Since the early days of quantum mechanics (QM), it 
has been debated whether or not QM can be completed 
with additional hidden variables (HVs), which would 
eventually account for the apparent indetcrminism of the 
results of single measurements in QM, and may end into 
a more detailed deterministic description of the world 
P-d. The problem of distinguishing QM from HV the- 
ories, however, cannot be addressed unless one makes 
additional assumptions about the structure of the HV 
theories. Otherwise, for a given experiment, one can 
just take the observed probability distributions as a HV 
model HI- Moreover, there are explicit HV theories, such 
as Bohmian mechanics 0, which can reproduce all 
experiments up to date. 

In the 1960s, it was found out that HV models repro- 
ducing the predictions of QM should have some peculiar 
and highly nonclassical properties. The most famous re- 
sult in this direction is Bell's theorem [3] . Bell's theorem 
states that local HV models cannot reproduce the quan- 
tum mechanical correlations between local measurements 
on some entangled states. In principle, the theorem just 
states a conflict between two descriptions of the world: 
QM and local HV models. However, the proof of Bell's 
theorem by means of an inequality involving correlations 
between measurements on distant systems, which is sat- 
isfied by any local HV model, but is violated by some 
quantum predictions @ , allows us to take a step further 
and test whether or not the world itself can be described 
by local HV models 0-GJ]. More recently, a similar ap- 
proach has been used to test whether or not the world 
can be reproduced with some specific nonlocal HV mod- 
els 0-Ojl. 

A second seminal result on HV models reproducing 
QM is the Kochen-Specker (KS) theorem [TMl- To 
motivate it, one first needs the notion of compatible mea- 



surements: two or more measurements are compatible, if 
they can be measured jointly on the same individual sys- 
tem without disturbing each other (i.e., without altering 
their results). Compatible measurements can be made 
simultaneously or in any order, and can be repeated any 
number of times on the same individual system and al- 
ways must give the same result independently of the ini- 
tial state of the system. 

Second, one needs the notion of noncontextuality A 
context is a set of compatible measurements. A physical 
model is called noncontextual if it assigns to a measure- 
ment a result independently of which other compatible 
measurements are carried out. There are some scenar- 
ios where the assumption of noncontextuality is specially 
plausible. For instance, in the case of measurements on 
distant systems, or in the case that the measurements 
concern different degrees of freedom of the same sys- 
tem and the degrees of freedom can be accessed inde- 
pendently. 

In a nutshell, the KS theorem states that noncontex- 
tual HV models cannot reproduce QM. This impossibility 
occurs already for a single three-level system, so it is not 
related to entanglement. 

There have been several proposals to test the KS theo- 
rem (20l-[24j , but there also have been debates whether the 
KS theorem can be experimentally tested at all (251433 ]. 
Nevertheless, first experiments have been performed, but 
these experiments required some additional assumptions 
[35l - l37l . l45j . Furthermore, the notion of contextuality has 
been extended to state preparations [38[ and experimen- 
tally investigated [39j ] . 

Quite recently, several inequalities have been proposed 
which hold for all noncontextual models, but are violated 
in QM, potentially allowing for a direct test [40l-|43l|. A 
remarkable feature of some noncontextuality inequalities 
is that the violation is independent of the quantum state 
of the system [42|, HU . In this paper, we will call these 
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inequalities KS inequalities, since the proof of the KS the- 
orem in Ref . fl9l ] is also valid for any quantum state of the 
system. Very recently, several experiments have found vi- 
olations of noncontextual inequalities (i3-l48j . Three of 
these experiments have found violations of a KS inequal- 
ity for different states [3, EH or for a single (maximally 
mixed) state [48| . In these experiments, compatible ob- 
servables are measured sequentially. 

The measurements in any experiment are never per- 
fect. In tests of noncontextuality inequalities, these im- 
perfections can be interpreted as a failure of the assump- 
tion that the observables measured sequentially on the 
same system are perfectly compatible. What if this com- 
patibility is not perfect? We will refer to this problem 
as the "compatibility loophole" . The main aim of this 
paper is to give a detailed discussion of this loophole and 
demonstrate that, despite of this loophole, still classes of 
HV models which obey a generalized definition of non- 
contextuality can be experimentally ruled out. 

The paper is organized as follows: In Sec. |TT] we give 
precise definitions of compatibility and noncontextuality, 
focusing on the case of sequential measurements. We 
also review some inequalities which have been proposed 
to test noncontextual HV models. 

In Sec. IHII we discuss the case of not perfectly compati- 
ble observables. We first derive an inequality which holds 
for any HV model, however, this inequality is not experi- 
mentally testable. Then, we consider several possible ex- 
tensions of noncontextuality. By that, we mean replac- 
ing our initial assumption of noncontextuality for per- 
fectly compatible observables by a new one, which covers 
also nearly compatible observables and implies the usual 
noncontextuality if the measurements arc perfectly com- 
patible. We then present several experimentally testable 
inequalities which hold for HV models with some general- 
ized version of noncontextuality, but which are violated in 
QM. One of these inequalities has already been found to 
be violated in an experiment |44j . In Sec. II VI we present 
details of this experiment. 

In Sec.|V]we present two explicit contextual HV models 
which violate all investigated inequalities. These models, 
which do not satisfy the assumptions of extended non- 
contextuality, are useful to understand which counterin- 
tuitive properties a HV model must have to reproduce 
the quantum predictions. Other contextual HV mod- 
els for contextuality experiments have been proposed in 
Ref. [49[. Finally, in Sec. IVII we conclude and discuss 
consequences of our work for future experiments. 



II. HIDDEN VARIABLE MODELS AND 
NONCONTEXTUALITY 

A. Joint or sequential measurements 

In the scenario originally used for discussing noncon- 
textuality fl9| . a measurement device is treated as a 
single device producing outcomes for several compati- 



ble measurements (i.e., a context). When treating the 
measurement device in this manner, the whole context is 
needed to produce any output at all. In this joint mea- 
surement, one of the settings of the measurement device 
is always specifically associated with one of the outcomes, 
in the sense that another measurement device exists that 
takes only that setting as input and gives an identical out- 
come as output. This is checked by repeatedly making 
a joint measurement and the corresponding compatible 
single measurements in any possible order. This is at the 
basis of the noncontextuality argument. The argument 
goes: precisely because another context-less device exists 
that can measure the outcome of interest, there is good 
reason to assume that this outcome is independent of the 
context in the joint measurement. 

In this paper we discuss sequential individual measure- 
ments, rather than joint measurements. It might be ar- 
gued that the version of the noncontextuality assump- 
tion needed in this scenario is more restrictive on the HV 
model than the version used for joint measurements. This 
would mean that a test using a sequential setup would be 
weaker than a test using a joint measurement setup, be- 
cause it would rule out fewer HV models. However, the 
motivation for assuming non-contextuality even in the 
joint measurement setup is the existence of the individ- 
ual measurements and their compatibility and repeatabil- 
ity when combined with joint context-needing measure- 
ments. Therefore, the assumptions needed in the sequen- 
tial measurements setting are equally well-motivated as 
the assumptions needed in the joint measurement setting. 

In fact, the sequential setting is closer to the actual 
motivation of assuming noncontextuality: there exist in- 
dividual context-less measurement devices that give the 
same results as the joint measurements, and we actu- 
ally use them in experiment. Furthermore, from an ex- 
perimental point of view, a changed context in the joint 
measurement device corresponds to a physically entirely 
different setup even for the unchanged setting within the 
context, so it is difficult to maintain that the outcome for 
the unc hang ed setting is unchanged from physical princi- 
ples [H, l5Cll | Motivating physically unchanged outcomes 
is much easier in the sequential setup, since the device 
used is physically identical for the unchanged setting. 

Therefore, in this paper we consider the situation 
where sequences of measurements are made on an in- 
dividual physical system. Throughout the paper, we 
consider only dichotomic measurements with outcomes 
±1, but the results can be generalized to arbitrary mea- 
surements. The question is: under which conditions can 
the results of such measurements be explained by a HV 
model? More precisely we ask which conditions a HV 
model has to violate in order to reproduce the quantum 
predictions. 
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B. Notation 

The following notation will be used in the discussed 
HV models: A is the HV, drawn with a distribution p(X) 
from a set A. The distribution summarizes all informa- 
tion about the past, including all preparation steps and 
all measurements already performed. Causality is as- 
sumed, so the distribution is independent of any event 
in the future. It rather determines all the probabilities 
of the results of all possible future sequences of measure- 
ments. We assume that, for a fixed value of the HV, the 
outcomes of future sequences of measurements are deter- 
ministic, hence all indeterministic behavior stems from 
the probability distribution. This is similar to the in- 
vestigation of Bell inequalities, where any stochastic HV 
model can be mapped onto a deterministic one where the 
HV is not known 0, [5l[ . 

In an experiment, one first prepares a "state" via cer- 
tain preparation procedures (which may include measure- 
ments). One always regards a state preparation as a pro- 
cedure which can be repeated. At the HV level, it will 
therefore lead to an experimentally accessible probability 
distribution p oxp (A). The HV model hence enables the ex- 
perimenter to repeatedly prepare the same distribution. 
In a single instance of an experiment, one obtains a state 
determined by a single value A of the HV. The prob- 
ability for this instance is distributed according to the 
distribution p exp (A), and reflects the inability of the ex- 
perimenter to control which particular value of the HVs 
has been prepared in a single instance. 

Continuing, we denote by Ai the measurement of the 
observable (or measurement device) A at the position 
i in the sequence. For example, A^B 2 C 3 denotes the 
sequence of measuring A first, then B, and finally 
C. An outcome from a measurement, e.g., B 2 from 
the above sequence, is denoted v{B 2 \A-iB 2 C 3 ) . The 
product of three outcomes is denoted v{A-iB 2 C 3 ) = 
v(A 1 \A 1 B 2 C 3 )v(B 2 \A 1 B 2 C 3 )v(C 3 \A 1 B 2 C 3 ). Given 
a probability distribution p(X), we write probabil- 



any sequence S S Sab, the obtained values of A and B 
remain the same, 



ities piB+lA^Cs) 
probability of obtaining 
C3 = —1) when the 
sured. One can also consider 
<B 2 |AiB 2 C 3 ) = p(B 2 + |A 1 B 2 C 3 ) 



[or piB+CzlA^Ca) 
the value B 2 = 
sequence A\B 2 C 3 



for the 
1-1 (and 
is mea- 
mean values like 
- p{B^\A 1 B 2 C 3 \ 



or the mean value of the 
(AiS 2 C 3 ) =p[v(A 1 B 2 C 3 ) = +1] - 



complete sequence, 
p[v{A 1 B 2 C 3 ) = 



C. Compatibility of measurements 

In the simplest case, compatibility is a relation be- 
tween a pair of measurements, A and B. For that, let 
Sab denote the (infinite) set of all sequences, which 
use only measurements of A and B, that is, Sab = 
{A 1 ,Bi,AiA 2 ,AiB 2 ,BiA 2 ,...}. Then, we formulate: 

Definition 1. — Two observablcs A and B are compat- 
ible if the following two conditions are fulfilled: 

(i) For any instance of a state (i.e. for any A) and for 



v(A k \S) = v(A l \S) ) 
v(B m \S)=v(B n \S), 



(la) 
(lb) 



where k, I, m, n are all possible indices for which the con- 
sidered observable is measured at the positions k,l,m,n 
in the sequence S. [Equivalcntly, we could require that 
p(A^ Af\S) = 0, etc., for all preparations corresponding 
to some Poxp(A).] 

(ii) For any state preparation [i.e., for any p exp (A)], the 
mean values of A and B during the measurement of any 
two sequences Si,S 2 £ Sab are equal, 



(AklS,) = (A^), 
(B m |5i) = (B n \S 2 ). 



(2a) 
(2b) 



Clearly, conditions (i) and (ii) are necessary conditions 
for compatible observables, in the sense that two observ- 
ables which violate any of them cannot reasonably called 
compatible. 

It is important to note that the compatibility of two ob- 
servables is experimentally testable by repeatedly prepar- 
ing all possible p e xp(A). The fact that this set is infinite 
is not a specific problem here, as any measurement de- 
vice or physical law can only be tested in a finite number 
of cases. A crucial point in a HV model is that the set 
of all experimentally accessible probability distributions 
Pexp(A) might not coincide with the set of all possible dis- 
tributions p(X). We will discuss this issue in Sec. MI Dl 

It should be noted that the conditions (i) and (ii) are 
not minimal, cf. Appendix A for a discussion. In partic- 
ular, we emphasize that (ii) does not necessarily follow 
from (i), as we illustrate by the following example: Con- 
sider a HV model where, for any A, all v(Ak\S) arc +1 
when the first measurement in S is Ai, while they are — 1 
when the first measurement is B\. The values v{B m \S) 
are always +1. Then, condition (i) is fulfilled, while (ii) 
is violated, since (A) = 1 but (A 2 \BiA 2 ) = — 1. 

Let us compare our definition of compatibility to 
the notion of "equivalent measurements" introduced by 
Spckkcns in Ref. [38j . In this reference, two measure- 
ments are called equivalent if, for any state prepara- 
tion, the probability distributions of the measurement 
outcomes for both measurements are the same. This is 
similar to our condition (ii) , but disregards repeated mea- 
surements on individual systems as in (i). Interestingly, 
using this notion and POVMs, one can prove the contex- 
tuality of a quantum- mechanical two- level system 38 [. 

Finally, it should be added that the notion of compat- 
ibility is extended in a straightforward manner to three 
or more observables. For instance, if three observables 
A, B, C are investigated, one considers the set Sabc of all 
measurement sequences involving measurements of A, B, 
or C and extends the conditions (i) and (ii) in an ob- 
vious way. This is equivalent to requiring the pairwise 
compatibility of A, B, C, cf. Appendix A. 
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D. Definition of noncontextuality for sequential 
measurements 

Noncontextuality means that the value of any observ- 
able A does not depend on which other compatible ob- 
servables are measured jointly with A. For our models, 
we formulate noncontextuality as a condition on a HV 
model as follows: 

Definition 2. — Let A and B be observables in a HV 
model, where A is compatible with B. We say that the 
HV model is noncontextual if it assigns, for any A, an 
outcome of A which is independent of whether B is mea- 
sured before or after A, that is, 



v(Ax) = v(A 2 \B 1 A 2 ). 



(3) 



Hence, for these sequences we can write down v(A) as 
being independent of the sequence. If the condition is 
not fulfilled, we call the model contextual. 

It is important to note that the condition ([3]) is an 
assumption about the model and — contrary to the def- 
inition of compatibility — not experimentally testable. 
This is due to the fact that for a given instance of a state 
(corresponding to some unknown A) the experimenter has 
to decide whether to measure A or B first. 

From this definition and the time ordering, it follows 
immediately that, if A is compatible with B and A is also 
compatible with C, then for noncontextual models 

viAMiB-i) = v{A 2 \B l A 2 ) = v^AMiC*) = v(A 2 \C 1 A 2 ). 

(4) 

holds. This is the often used definition of noncontextual 
models, stating that the value of A does not depend on 
whether B or C is measured before, jointly with, or after 
it. 

This definition can directly be extended to three or 
more compatible observables. For instance, if {A,B,C} 
are compatible, then noncontextuality means that for any 
A, 

v(A 1 ) = v(A 2 \B 1 A 2 )=v(A 2 \C 1 A 2 ) 

= v(A 3 \BiB 2 A 3 ) = viAslB^As) 

= v(A 3 \C 1 B 2 A 3 ) = viAaldCaAa). (5) 

Of course, the equalities in the second and third line fol- 
low, if the first line holds for any A and the HV model 
allows to see the measurement of B\ or C\ as a prepa- 
ration step. Again, if {A, a, a} is another set of compat- 
ible observables, one can derive consequences similar to 
Eq. 0. 



E. Inequalities for noncontextual HV models 



1. CHSH-like inequality 

To derive a first inequality, consider the mean value 

(xchsh) = (AB) + (BC) + (CD) - (DA) . (6) 

If the measurements in each average are compatible [i.e., 
the pairs (A, B), (B, C), (C, D), and (£>, A) are compati- 
ble observables)] , then a noncontextual HV model has to 
assign a fixed value to each measurement, and the model 
predicts 



I (Xchsh) | < 2. 



(7) 



In QM, on a two-qubit system, one can take the observ- 
ables 



A = a x ® 1, 



B = 1< 



(o- z + o- x ) 



C = <7 Z ®1, D^t(g, ^ z ° x \ (8) 

v2 

then, the measurements in each sequence are commuting 
and hence compatible, but the state 



!</,+) = (|oo) + |n))A/2 



(9) 



leads to a value of (xchsh) = 2\/2, therefore not allowing 
any noncontextual description. The choice of the observ- 
ables in Eq. ((5|) is, however, by no means unique, if one 
transforms all of them via the same global unitary trans- 
formation, another set is obtained, and the state leading 
to the maximal violation does not need to be entangled. 
In fact, the two-qubit notation is only chosen for con- 
venience and could be replaced by a formulation with a 
single party using a four-level system. For example, if we 
take the observables 
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then, the measurements in each sequence are commuting 
and hence compatible, but the product state 



If! 



z+)|0> = (|00> + |10»A/2 



(11) 



leads to a value of (xchsh) = 2\/2, therefore not allowing 
any noncontextual description. 



Here we will discuss several previously introduced 
inequalities involving compatible measurements, which 
hold for any noncontextual HV model, but which are vi- 
olated for certain states and observables in QM. Later, 
these inequalities are extended to the case where the ob- 
servables are not perfectly compatible. 



2. The KCBS inequality 

As a second inequality, we take the pentagram inequal- 
ity introduced by Klyachko, Can, Binicioglu, and Shu- 
movsky (KCBS) [4lj . Here, one takes five dichotomic 



5 



obscrvables and considers 

(xkcbs) = (AB) + (BC) + (CD) + (DE) + (EA). (12) 

If the obscrvables in each mean value are compatible and 
noncontcxtuality is assumed, it can be seen that 

(xkcbs) > -3 (13) 

holds. However, using appropriate measurements on a 
three-level system, there are qutrit states which give a 
value of (xkcbs) = 5 — 4^5 ~ —3.94, also leading to 
contradiction with noncontextuality. 

3. An inequality from the Mermin-Peres square 

For the third inequality, we take the one introduced in 
Ref. [42j. Consider the mean value 

(xks) =(ABC) + (abc) + (a/3 7 ) + (Aaa) + (Bb(3) 

- (Ccy). (14) 

If the measurements in each expectation value are com- 
patible, then any noncontextual HV model has to assign 
fixed values to each of the nine occurring measurements. 
Then, one can see that 

(Xks) < 4. (15) 

However, on a two-qubit system, one can choose the ob- 
servables of the Mermin-Peres square [H, [H[ 

A = a z ®t, B = t®a z , C = a z <S><r z , 
a = 1 ® <j x , b = a x ® 1, c = <j x ® a x , (16) 
a = a z ®a x , /? = a x (8 a z , j = a y ®cr y . 

The observables in any row or column commute and are 
therefore compatible. Moreover, the product of the ob- 
scrvables in any row or column equals 1, apart from the 
last column, where it equals —1. Hence, for any quantum 
state, 

(Xks) - 6 (17) 

holds. The remarkable fact in this result is that it shows 
that any quantum state reveals nonclassical properties if 
the measurements are chosen appropriately. 

III. NOT PERFECTLY COMPATIBLE 
MEASUREMENTS 

In any real experiment, the measurements will not be 
perfectly compatible. Hence, the notion of noncontextu- 
ality does not directly apply. The experimental violation 
of inequalities like Q, (fT5jl . and (fT5|) proves that one 
cannot assign to the measurement devices independent 
outcomes ±1. However, a model that is not trivially in 
conflict with QM also has to explain the measurement 



results of sequences of incompatible obscrvables, such as 
e.g. the results from measuring A1C2 for the obscrvables 
of the CHSH-likc inequality. Therefore, it is not straight- 
forward to find out which arc the implications of these vi- 
olations on the structure of the possible HV models. The 
reason is that the assumption that incompatible measure- 
ments have predetermined independent outcomes is not 
physically plausible. 

To deal with this problem, we will derive extended ver- 
sions of the inequalities J7]), (fT3")) , and (fl"5"j) . which are 
valid even in the case of imperfect compatibility. We will 
first derive an inequality which is an extension of inequal- 
ity ([7]) and which holds for any HV model. This inequal- 
ity, however, contains terms which are not experimentally 
accessible. Then, we investigate how these terms can be 
connected to experimental quantities, if certain assump- 
tions about the HV model are made. We will present 
three types of testable inequalities, the first two start 
from condition (i) of Definition 1, while the third one 
uses condition (ii). 

First we consider nearly compatible obscrvables. We 
show that, if the observables fulfill the condition (i) of 
Definition 1 to some extent and if assumptions about the 
dynamics of probabilities in a HV model are made, then 
these HV models can be experimentally refuted. 

In the second approach, we consider the case that a cer- 
tain finite number of compatibility tests has been made. 
For some runs of the experiment the tests are successful 
[i.e., no error occurs when checking condition (i)], and in 
some runs errors occur. We then assume that the subset 
of HVs, where noncontcxtuality holds is at least as large 
as the subset where the compatibility tests are success- 
ful. We then show that HV models of this type can, in 
principle, be refuted experimentally. 

Finally, in the third approach, wc also consider as- 
sumptions about the possible distributions p exp (X), and 
show that if the condition (ii) of Definition 1 is nearly 
fulfilled, then again this type of HV models can experi- 
mentally be ruled out. 

We will discuss these approaches using the CHSH-like 
inequality ([7|)- At the end of the section, we will also 
explain how the inequalities (THi]) and (TT51) have to be 
modified, in order to test these different types of HV 
models. 



A. CHSH-like inequality for all HV models 

To start, consider a HV model with a probability distri- 
bution p(X) and let and (B^\Bi)] denote the 
probability of finding A + if A is measured first and B + 
if B is measured first. This probability is well defined in 
all HV models of the considered type, but it is impossible 
to measure it directly, as one has to decide whether one 
measures A or B first. Our aim is now to connect it to 
probabilities arising in sequential measurements, as this 
will allow us to find contradictions between HV models 
and QM. 
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First, note that 

p[(A+|A0 and < p[A+ ^+^2]+ 

+ p[(B+|B 1 ) and (B^-|AiB 3 )]. (18) 

This inequality is valid because if A is such that it 
contributes to p[(Af \A\) and (B± \Bi)], then either the 
value of B stays the same when measuring A\B 2 (hence 
A contributes to ^[A^ , B.^ |AiB 2 ]) or the value of B is 
flipped and A contributes to p[(B^\Bi) and (B 2 \AiB 2 )\. 
The first term p[A^~, B 2 \AiB 2 ] is directly measurable as 
a sequence, but the second term is not experimentally 
accessible. 

Let us rewrite 

(AB) = l-2p[(A+\A 1 ) and (flr|fli)] 

-2p[{Ai\A{) and {B+^% (19) 

as the mean value obtained from the probabilities 
p[{Af\A x ) and (Bf\B{)}. Then, using Eq. (JBJ), it follows 
that 

(A 1 B 2 )-2/ ip [AB] < (AB) < (A 1 B 2 )+2/ ip [ylB], (20) 

where we used / ip [AB] = p[(j9^|B x ) and (B 2 \AiB 2 )] + 
p[(Bf and (B+ |Ai-B 2 )]- This p flip [AB] can be inter- 
preted as a probability that A flips a predetermined value 
of B. 

Furthermore, using Eqs. © and (O, we obtain 

K#chsh>| < 2(1 + P m v[AB] + / ip [CB] 

+ P tiip [CD] +p^[AD]) 1 (21) 

where 

(A-chsh) := {AxB 2 ) + {dB 2 ) + (C\D 2 ) - (A 1 D 3 ). (22) 

Inequality (f2T1) holds for any HV model and is the gener- 
alization of inequality ([7|). Note that for perfectly com- 
patible observables, the flip terms in inequality (|2ip van- 
ish if the assumption of noncontextuality is made. Then, 
this results in inequality ([7]). 

B. First approach: Constraints on the disturbance 
and the dynamics of the HV 

The terms p P [AB), etc. in inequality (j2"Tj) are not ex- 
perimentally accessible. Now we will discuss how they 
can be experimentally estimated when some assumptions 
on the HV model are made. 

In order to obtain an experimentally testable version 
of inequality (|21[) . we will assume that 

p[(Bt\Br) and (B 2 \A 1 B 2 )} 

<p[(fl+|Bi) and (B+.B^BiAa-Ba)] 

= p[B+,B^\B 1 A 2 B 3 \. (23) 



This assumption is motivated by the experimental pro- 
cedure: Let us assume that one has a physical state, 
for which one surely finds B^ if Bi is measured first, 
but finds B 2 if the sequence A\B 2 is measured. Phys- 
ically, one would explain this behavior as a disturbance 
of the system due to the experimental procedures when 
measuring A 1 . The left-hand side of Eq. (|2"3")l can be 
viewed as the amount of this disturbance. The right- 
hand side quantifies the disturbance of B when the se- 
quence B\A 2 B 3 is measured. In real experiments, it can 
be expected that this disturbance is larger than when 
measuring A\B 2 , because of the additional experimental 
procedures involved. Note that in real experiments, a 
measurement of B will also disturb the value of B itself, 
as can be seen from the fact that sometimes the values 
of Bi and B 2 will not coincide, if the sequence B\B 2 is 
measured. 

It should be stressed, however, that we do 
not assume that the set of HV values giving 
[(Bt\Bi) and (B 2 \AiB 2 )\ is contained in the set giving 
{Bf , B% \BiA 2 B 3 ), the assumption only relates the sizes 
of these two sets. 

In addition, by a similar reasoning, the assump- 
tion ([2"3"|l may be relaxed to 

p[(B+\B!) and (B?\AiB 2 )] < p[B^ , B^\BiSA k -iB k ], 

(24) 

where 5* is a given finite sequence of measurements from 
Sab- Again, if the measurements are nearly compatible, 
this type of HV models can be ruled out experimentally. 

Assumption (1231) gives an measurable upper bound to 
p mp [AB}. One directly has 

|(*chsh>| <2(1 +p erT [B 1 A 2 B 3 ] +p m [B 1 C 2 B 3 ] 

+ p m [D l C 2 D 3 ] +p c "[D 1 A 2 D 3 ]) 1 (25) 

where we used 

p m [BiA a B 3 ] = P [B+, B 3 -|B 1 A 2 B3]+p[Sr, B+lB^B^ 

(26) 

denoting the total disturbance probability of B when 
measuring B\A 2 B 3 . 

The point of this inequality is that if the observable 
pairs (A, B), (C,B), (C,D), and (A, D) fulfill approx- 
imately the condition (i) in the definition of compati- 
bility, the terms p m will become small, and a violation 
of inequality (f25"j) can be observed. In Ref. 44j it was 
found that (Achsh) - 2{p c "[B 1 A 2 B 3 ] + p c "[B 1 C 2 B 3 ] + 
p orr [L>iC 2 £> 3 ] +p CII [D 1 A 2 D 3 }) = 2.23(5). Hence this ex- 
periment cannot be described by HV models which fulfill 
Eq. ([2"3"jl. see also Section IV. 



C. Second approach: Assuming noncontextuality 
for the set of HVs where the observables are 
compatible 

Let us discuss a different approach to obtain exper- 
imentally testable inequalities. For that, consider the 



7 



case that the experimenter has measured a (finite) set of 
sequences in Sab in order to test the validity of condition 
(i) in the definition of compatibility. He finds that the 
conditions are violated or fulfilled with certain probabil- 
ities. In terms of the HV model, there is a certain subset 
Aab C A of all HVs where all tests in the finite set of ex- 
perimentally performed compatibility tests succeed and 
through the observed probabilities the experimenter can 
estimate the volume of this set. 

In this situation, one can assume that, for each HV 
A € Aab (where all the measured compatibility re- 
quirements are fulfilled), the assumption of noncontex- 
tuality is also valid. More precisely, one can assume 
that v(Ai\A 1 B 2 ) = v(A 2 \B 1 A 2 ) in Eq. © holds for all 
A G Aab- One may support this assumption if one con- 
siders noncontextuality as a general property of nature, 
since this is the usual noncontextuality assumption for 
the HV model where the HVs are restricted to Aab- 

To see that this assumption leads to an experi- 
mentally testable inequality, consider the case where 
the experimenter has tested all sequences up to 

(3) 

length three, that is all sequences from S\ B — 
{AiA 2 A 3 , AiA 2 B 3 , . . . , B X B 2 B 3 } and has determined, 
for each of them, the probability p e "(S) that some 
measurement, which is performed two or three times 
in the sequence is disturbed. For sequences like 
SiA 2 B 3 , this is exactly p crl [B^B^ defined in Eq. ([25]) . 
However, now we have additional error terms like 
p^lB^As] = p[B+, B- \B,B 2 A 3 ] +p[B~ , B+^B^} 
and_ p^jB^B-i] = 1 - p[B+ , B+ B+^B^} - 
p\B l ,B 2 B 3 \BiB 2 B 3 ], etc. These probabilities are not 
completely independent: due to the time ordering, a A 
that contributes to p^^B^A^ (or p^^A^]) will 
also contribute to p^iB^Bs] (or p crr L4i A 2 A 3 }). Con- 
sequently, relations like p c "[B 1 B 2 A 3 ] < p c "[B 1 B 2 B 3 ] 
hold. 

Let us define 

P c "[Sab] = (E P m [^)-p m [BiB 2 A 3 ]-p err [yM 2 fl 3 ]. 

(27) 

Here, we have excluded two p clT in the sum, as the A's 
which contribute to them are already counted in other 
terms. With this definition, for a given distribution 
Pexp(A), a lower bound to the probability of finding a 
A where condition (i) from Definition 1 is fulfilled, is 

P [Aab}>1-P cit [Sa 3) b}- (28) 

From that and the assumption that v(Ai\AiB 2 ) = 
v(A 2 \BiA 2 ) on Aab, it directly follows that 

p^[AB]<p™(sf B ), (29) 

giving a measurable upper bound to p m P[AB}. Finally, 
the experimentally testable inequality 

|(A- C hsh)| < 2(1 + P m [S A 3} B ] +p m [S { cl} 

+ p c "[S^ D ]+p c "[S^ D }) (30) 



holds. This inequality is similar to inequality (|2"5|) . but 
it contains more error terms. Nevertheless, a violation of 
this inequality in ion-trap experiments might be feasible 
in the near future (see Sec. IIV|) . 

This result deserves two further comments. First, in 
the derivation we assumed a pointwise relation; namely, 
for all A G Aab, the noncontextuality assumption 
v(Ai\AiB 2 ) = v(A 2 \BiA 2 ) holds. Of course, we could 
relax this assumption by assuming only that the volume 
of the set where v(A 1 \AiB 2 ) = v(A 2 \B 1 A 2 ) holds is not 
smaller than the volume of Aab- Under this condition, 
Eq. 4H still holds. 

Second, when comparing the second approach with the 
first one, one finds that the first one is indeed a special 
case of the second one. In fact, from a mathematical 
point of view, the first approach is the same as the sec- 
ond one, if in the second approach only the compatibility 
test S = B\A 2 B 3 is performed. Consequently, inequal- 
ity (f25|) is weaker than ([30]) . However, note that the 
first approach came from a different physical motivation. 
Further, assuming a pointwise relation for the first ap- 
proach is very assailable, as only one compatibility test 
is made. But, as we have seen, a relation between the 
volumes suffices. A pointwise relation can only be mo- 
tivated if all experimentally feasible compatibility tests 
are performed. 

D. Third approach: Certain probability 
distributions cannot be prepared 

The physical motivation of the third approach is as fol- 
lows: The experimenter can prepare different probability 
distributions p exp (X) and check their properties. For in- 
stance, he can test to which extent the condition (ii) in 
Definition 1 is fulfilled. However, in a general HV model 
there might be probability distributions p(X) that do not 
belong to the set of experimentally accessible p eX p(A). 
One might be tempted to believe that this difference is 
negligible and that the properties that can be verified 
for the Poxp(A) hold also for some of the p{\). In this 
approach we will show that this belief can be experimen- 
tally falsified. More specifically, we show that if only 
four conditional probability distributions have the same 
properties as all p e xp(A), then a contradiction with QM 
occurs. 

So let us assume that the experimenter has checked 
that the observablcs A and B fulfill condition (ii) in Def- 
inition 1 approximately. He has found that 

\(B 1 \B 1 A 2 )-(B 2 \A 1 B 2 )\< SAB (31) 

for all possible (or, at least, a large number of) p exp (A). 
This means that, for experimentally accessible distribu- 
tions Pexp(A), one has that 

\p{Bt\BxA 2 ) -p{B+\A x B 2 )\ < sab/2, 
\piBi\BiAa) -p{B 2 \A x B 2 )\ < e A bI% (32) 
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as can be seen by direct calculation. 

Let us consider the flip probability 
p Rip [AB] p[(Bt\Bi) and {B^A^)} + 

p[(B 1 \B{) and (B 2 \AiB 2 )\ again. Here, the prob- 
ability p stems from the initial probability distribution 
p(X). One can consider the conditional probability 
distributions q ± (X) which arise from p(X) if the result 
of B\ is known. Physically, the conditional distributions 
describe the situation for an observer, who knows 
that the experimenter has prepared p(X) but has the 
additional information that measurement of Bi will give 
+1 or —1. With that, we can rewrite 

p Rip [AB] = q+iB^AxB^piB^Bi) 

+ q -{B+\A 1 B 2 )p{B^\B 1 ). (33) 

Now let us assume that these conditional probability dis- 
tributions have the same properties as all accessible dis- 
tributions Pexp(X). Then, the bounds in Eq. (|32[) also 
have to hold for q ± . Since p(B^\B{) + p(B{ \B{) = 1, 
it follows directly that p P [AB] < eab/2- Hence, under 
the assumption that some conditional probability distri- 
butions in the HV model have similar properties as the 
preparable p exp (X), the inequality 

(Achsh) < 2 + E A B + £cb + £cd + £ad (34) 

holds. A violation of it implies that, in a possible 
HV model, certain conditional probability distributions 
have to be fundamentally different from experimentally 
preparable distributions. 

Again, this result deserves some comments. First, 
note that the tested bound in Eq. ((52"]) does not have 
to hold for all probability distributions in the theory. In 
an experiment testing Eq. (|34p with some p exp (X) only 
assumptions about four conditional probability distri- 
butions (corresponding to two possible second measure- 
ments with two outcomes) have to be made. In fact, 
assuming Eq. (f3"2"j) for ^-distributions (i.e., a fixed HV A) 
is not very physical, as in this case the left-hand side of 
these equations is or f . 

Second, finding an experimental violation of Eq. (|34|) 
shows that these four distributions have properties sig- 
nificantly different from all preparable p xp(A). In other 
words, one may conclude that in a possible HV model 
describing such an experiment, it must be forbidden to 
prepare p exp (A) with additional information about the 
result of B or D. 

To make this last point more clear, consider the situ- 
ation where the experimenter has prepared p exp (X) and 
a second physicist has the additional knowledge that the 
result of B\ will be +1, if it would be measured as a first 
instance. Both physicists disagree on the probability dis- 
tribution j5 oxp and q + , but that is not the central problem 
because this occurs in any classical model as well. The 
point is that q + cannot be prepared: If the experimenter 
measures B\ and keeps only the cases where he finds +1 
he obtains a new experimentally accessible probability 
distribution p cxp . But this will not be the same as the 



probability distribution g + , because in this case, the first 
measurement has already been made. 

E. Application to the KCBS inequality and the KS 
inequality (|15p 

In the previous discussion, we used the CHSH like in- 
equality ([7]) to develop our ideas. Clearly, one could also 
start from inequalities (fT3"|) and (TT5|) to obtain testable 
inequalities for the types of HV models discussed above. 

For the KCBS inequality (fT2"j) this can be done with 
the same methods as before, since the KCBS inequality 
uses only sequences of two measurements, as the CHSH 
inequality (J7| . A generalization of Eq. (jMJ) is 

(*kcbs) := {AiB 2 ) + (dB 2 ) + (C\D 2 ) + (E 1 D 2 ) 

+ (EiA 2 ) > -3 - (e AB + £cb + £cd + £ed + £ea)- 

(35) 

Generalizations of Eqs. (f2"5|) and (f3"D"|) can also be written 
down in a similar manner. 

Also for the KS inequality (jl~5|) . one can deduce gen- 
eralizations, which exclude certain types of HV mod- 
els. The main problem here is to estimate a term like 
{AiB 2 Cz). First, an inequality corresponding to Eq. (fT8j) 
is 

PKA+IA,) and (B+IB,) and (C+|d)] 

< p[A+,B+,C+\A 1 B 2 C 3 ] + 

+ p[(B+\B 1 ) and {B^A^)) 

+ p[(C+\C 1 ) and (Cg-IAifliA)], (36) 

which holds again for any HV model. Then, a direct 
calculation gives that one has 

(ABC) < (AiB 2 C 3 ) +4/ ip [AB] + 4p mp [(AB)C] 

(ABC) > (A!B 2 C 3 ) - 4p mp [AB] - 4p mp [(AB)C], (37) 

where 

p mp [(AB)C] =p[(C+\C 1 ) and (C^BaCs)] 

+ p[(Cf |d) and (Cg-|AiB a C 3 )]. (38) 

Given these bounds, one arrives at testable inequalities, 
provided assumptions on the HV model are made as in 
the three approaches above. If Eq. (f23j) is assumed, one 
can directly estimate p a[p [AB] < p c "[A l B 2 ] and 

p mp [(AB)C] <p eiI [(AB)C] (39) 

= p[CfC7|CiAafl3Cy + P\- C i c i \CiA 2 B 3 C 4 }. 
Then, if one writes down the generalized form of Eq. (TTSl) , 
then there are more correction terms than in Eq. (|25D . 
Moreover, they involve sequences of length four. On av- 
erage, these p e " terms have to be smaller than 2/48 « 
0.0417 in order to allow a violation. Consequently, an 
experimental test is very demanding (see also the discus- 
sion in subsection IIV CJ) . Finally, generalizations in the 
sense of Eqs. (|BT)1 and |3"4"|) can also be derived in a similar 
manner. 



9 




397 nm 

(cooling/ 

detection) 



FIG. 1: Partial level scheme of 40 Ca + showing the relevant 
energy levels and the laser wavelengths needed for coupling 
the states. The D-states are metastable with a lifetime of 
about Is. A magnetic field of about 4 Gauss is applied to lift 
the degeneracy of the Zeeman states. The states |0), |1) used 
for encoding quantum information are indicated in the figure. 



IV. EXPERIMENTAL IMPLEMENTATION 

Experimental tests of noncontextual HV theories have 
been carried out with photons (35l - l37l. l45l . Hfjj , neutrons 
[37l . Hf| , laser-cooled trapped ions I44|. and liquid-state 
nuclear magnetic resonance systems [481 ] . In the exper- 
iments with photons and neutrons, single particles were 
prepared and measured in a four-dimensional state space 
composed of two two-dimensional state spaces describing 
the particle's polarization and the path it was follow- 
ing. In contrast, in a recent experiment with trapped 
ions [44[ , a composite system comprised of two trapped 
ions prepared in superpositions of two long-lived internal 
states was used for testing the KS theorem. In the follow- 
ing, we will describe this experiment and present details 
about the amount of noncompatibility of the observablcs 
implemented. 



A. Experimental methods 



cxp[— i|((T0 <g) 1 + 1 <g) cr^)] are realized by resonantly 
coupling the states |0), Finally, the single-qubit 

gate U z (9) = exp(— i|cr z ) is implemented by a strongly 
focused laser inducing a differential light-shift on the 
states of the first ion. This set of operations, S = 
{U z (6),U(6,cj)),U MS (6, </>)}, which is sufficient for con- 
structing arbitrary unitary operations, can be used for 
preparing the desired input states 

A measurement of a z by a state projection onto the ba- 
sis states |0), |1) on one of the ions is carried out by illu- 
minating the ion with laser light coupling the Sx/2 ground 
state to the short-lived excited state P\/2 and detecting 
the fluorescence emitted by the ion with a photomulti- 
plier. Population in P1/2 decays back to S1/2 within a 
few nanoseconds so that thousands of photons are scat- 
tered within a millisecond if the ion was originally in the 
state |1). If it is in state |0), it does not couple to the 
light field and therefore scatters no photons. In the ex- 
periment, we assign the state |1) to the ion if more than 
one photon is registered during a photon collection pe- 
riod of 250/xs. In this way, the observables a z <£> 1 and 
1 <£) o z can be measured. 

To measure further observables like <7j ® l , 1 ® crj , or 
<7j ® <jj, the quantum state p to be measured is trans- 
formed into U pU' by a suitable unitary transformation 
U prior to the state detection. Measuring the value of 
a z (8 1 on the transformed state is equivalent to mea- 
suring the observable A = W(a z g) t)U on the original 
state p. The measurement is completed by applying the 
inverse operation [/' after the fluorescence measurement. 
The purpose of this last step is to map the projected state 
onto an eigenstate of the observable A. In this way, any 
observable A with two pairs of degenerate eigenvalues 
can be measured. The complete measurement, consist- 
ing of unitary transformation, fluorescence detection and 
back transformation, constitutes a quantum nondemoli- 
tion measurement of A. Each measurement of a quantum 
state yields one bit of information which carries no infor- 
mation about other compatible observables. 



Trapped laser-cooled ions are advantageous for these 
kinds of measurements because of the highly efficient 
quantum state preparation and measurement procedures 
trapped ions offer. In Ref. [44[, a pair of 40 Ca + ions was 
prepared in a state space spanned by the states |00), 1 01) , 
1 10) , |00), where |1) = \Si/ 2 ,ms = 1/2) is encoded in a 
Zeeman ground state and |0) = \D 5 /2i m D = 3/2) in a 
long-lived metastable state of the ion (see Fig. [T]). 

A key clement for both preparation and measurement 
are laser-induced unitary operations that allow for ar- 
bitrary transformations on the four-dimensional state 
space. For this, the entangling operation U MS (6, <fi) = 
cxp(— if<T0 <g> cr^) where er^ = cos(</>) a x + sin(0) a y is 
realized by a bichromatic laser field off-resonantly cou- 
pling to transitions involving the ions' center-of-mass 
mode along the weakest axis of the trapping potential 
f54| . In addition, collective single-qubit gates U{9,4>) — 



B. Measurement results 

The measurement procedure outlined above is very 
flexible and can be used to consecutively measure sev- 
eral observables on a single quantum system as illus- 
trated by the following example. To test inequal- 
ity Eq. (fH| for the observables of the Mermin-Pcrcs 
square p^|) . the quantum state \tp) = |ll)/\/2 + 
e*5 (|01) + |10))/2 is prepared by a applying the sequence 
of gates U MS (~n/2,n/4)U MS {-n/2,0)U(n/2,0) to the 
initial state |11). The correlations that are found for a 
sequence of measurements A1B2C3, where A-y = a z <£> er 2 , 
B2 = Px <8> &x j and C3 = a y ® a y are shown in Fig. [21 For 
this measurement, 1100 copies of the state were created 
and measured. Each corner of the sphere corresponds 
to a measurement outcome (v\, v 2 , V3) where = ±1 
is the measurement result for the k th observable. The 
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FIG. 2: Measurement correlations for a sequence of mea- 
surements A1B2C3 with A\ = a z ® a z , B2 = o x ® a x , and 
C3 — <J y ® (jy for a partially entangled input state. The 
colors indicates whether V1V2V3 = +1 (yellow spheres) or 
V1V2V3 = —1 (red spheres). The volume of a sphere is pro- 
portional to the likelihood of finding the corresponding mea- 
surement outcome (wi,«2, ir- 
relative frequencies of the measurement outcomes are in- 
dicated by the volume of the spheres attached to the 
corners, and the colors indicates whether V1V2V3 = +1 or 
V1V2V3 = — 1. For perfect state preparation and measure- 
ments, one would expect to observe always V1V2V3 = — 1. 
Due to experimental imperfections, the experiment yields 
(v\ V2V3) = —0.84(2). Nevertheless, the experimental re- 
sults nicely illustrate the quantum measurement process: 
the first measurement gives (a z <E> c z ) = 0.00(2), i.e. the 
state \tjj} is equally likely to be projected onto |* + ) = 1 1 1) 
(vi = +1) and onto |*_) = ([01) + \W))/V2 (vi = -1). 
In the latter case, the projected state | , F_) is an eigen- 
state of a x <g> a x and o~ y o~ y so that these measurements 
give definite results v 2 = +1 and V3 = +1 (upper left 
corner of Fig. [5]). In the former case, the projected state 
is not an eigenstate of a x ® a x and «2 = +1 and V3 = — 1 
are found with equal likelihood. In this case, V2 and i>3 
are random but correlated with V2V3 = 1 (the other two 
strongly populated corners of Fig. 0). 

In Ref. [H[, also the other rows and columns of the 
Mermin-Peres square (fTT5|) were measured for the state 
\tp), and a violation of Eq. (fH)) was found with (xks) = 
5.36(4). Also different input states were investigated to 
check that the violation is indeed state-independent. The 
fact that the result falls short of the quantum mechan- 
ical prediction of (xks) — 6 is due to imperfections in 
the measurement procedure. These imperfections could 
be incorrect unitary transformations, but also errors oc- 
curring during the fluorescence measurement. 

An instructive test consists in repeatedly measuring 
the same observable on a single quantum system and an- 
alyzing the measurement correlations. Table U shows the 
results of five consecutive measurements of A = a z <E> 1 
on a maximally mixed state based on 1100 experimental 
repetitions. 

As expected, the correlations (AiAi + k\A\ . . . A$) are 



TABLE I: Measurement correlations (AiAj \Ai . . . As) be- 
tween repeated measurements of A — a z ® 1 for a maximally 
mixed state. Observing a correlation of (AiA;|^4i . . . A5) = 
onj means that the probability for the measurement results of 
Ai and Aj to coincide equals (a»j + l)/2. 



Measurement 


2 3 4 


5 


1 


0.97(1) 0.97(1) 0.96(1) 


0.95(1) 


2 


0.97(1) 0.97(1) 


0.96(1) 


3 


0.98(1) 


0.98(1) 


4 




0.98(1) 



TABLE II: Measurement correlations (AiAj \Ax . . . A 5 ) be- 
tween repeated measurements of A = a x $5 <j x for a maximally 
mixed state. Observing a correlation of (AiAj \A\ . . . As) = 
otij means that the probability for the measurement results of 
Ai and Aj to coincide equals (onj + l)/2. 



Measurement 


2 3 4 5 


1 


0.94(1) 0.88(1) 0.82(2) 0.80(2) 


2 


0.93(1) 0.87(2) 0.84(2) 


3 


0.90(1) 0.87(2) 


4 


0.93(1) 



independent of the measurement number i within the 
error bars. However, the correlations become smaller and 
smaller the bigger k gets. Table HT1 shows another set of 
measurements correlations (j4jAj|vli . . . A 5 ), where A = 
a x ®<j x . Here, the correlations are slightly smaller, since 
entangling interactions are needed for mapping A onto 
<7 z £g)]l, which is experimentally the most demanding step. 

It is also interesting to compare the correlations 
(AiA 3 \AiA 2 A3) with the correlations (AiA 3 | AiB 2 A 3 ) 
for an observable B that is compatible with A. 
For A = a x (£> a x and B = a z <E> er 2 , we find 
(A 1 A3\A 1 A 2 A 3 ) = 0.88(1) and (A 1 A3\A 1 B 2 A 3 ) = 
0.83(2) when measuring a maximally mixed state; i.e., 
it seems that the intermediate measurement of B per- 
turbs the correlations slightly more than an interme- 
diate measurement of A. Similar results are found 
for a singlet state, where (AiA 3 \AiA 2 A 3 ) = 0.92(1), 
(B^B^Bs) = 0.91(1), but (AiA 3 \AiB 2 A 3 ) = 
0.90(1), and (B^^B^B^) = 0.89(1). Because of 
(B 1 B 3 \B 1 A 2 B 3 ) = l-2p Q " correlations of the 

type (BiB 3 \BiA2B 3 ) are required for checking inequality 
([25j) that takes into account disturbed HVs. 



C. Experimental limitations 

There are a number of error sources contributing to 
imperfect state correlations, the most important being: 

(i) Wrong state assignment based on fluorescence data. 
During the 250 /is detection period of the current exper- 
iment, the number of detected photons has a Poissonian 
distribution with an average number of =8 photons 
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if the ion is in state |1). If the ion is in state |0), it does 
not scatter any light, however, light scattered from trap 
electrodes gives rise to a Poissonian distribution with an 
average number of = 0.08 photons. These photon 
count distributions slightly overlap. The probability of 
detecting or 1 photons even though the ion is in the 
bright state, is 0.3%. The probability of detecting more 
than one photon if the ion is in the dark state is also 0.3%. 
Therefore, if the threshold for discriminating between the 
dark and the bright state is set between 1 and 2, the prob- 
ability for wrongly assigning the quantum state is 0.3%. 
Making the detection period longer would reduce this er- 
ror but increase errors related to decoherence of the other 
ion's quantum state that is not measured. 

(ii) Imperfect optical pumping. During fluorescence 
detection, the ion leaves the computational subspace 
{|0), |1)} if it was in state |1) and can also populate the 
state 1 5*1/2, m s = — 1/2}- To prevent this leakage, the ion 
is briefly pumped on the S1/2 O -Pi/ 2 transition with cr + - 
circularly polarized light to pump the population back 
to |1). Due to imperfectly set polarization and misalign- 
ment of the pumping beam with the quantization axis, 
this pumping step fails with a probability of about 0.5%. 

(hi) Interactions with the environment. Due to the 
non-zero differential Zeeman shift of the states used for 
storing quantum information, superposition states de- 
phase in the presence of slowly fluctuating magnetic 
fields. In particular, while measuring one ion by fluo- 
rescence detection, quantum information stored in the 
other ion dephases. We partially compensate for this ef- 
fect by spin-echo-like techniques (5f| that are based on a 
transient storage of superposition states in a pair of states 
having an opposite differential Zeeman shift as compared 
to the states |0) and |1). A second interaction to be taken 
into account is spontaneous decay of the metastable state 
|0) which however only contributes an error of smaller 
than 0.1%. 

(iv) Imperfect unitary operations. The mapping op- 
erations arc not error-free. This concerns in particular 
the entangling gate operations needed for mapping the 
cigenstatc subspace of a spin correlation o~i ® o~j onto the 
corresponding subspaces of o~ z ® 1. For this purpose, a 
M0lmer-S0rensen gate operation U MS (ir/2, 4>) 0, HH is 
used. This gate operation has the crucial property of re- 
quiring the ions only to be cooled into the Lamb-Dickc 
regime. In the experiments, the centcr-of-mass mode 
used for mediating the gate interaction is in a thermal 
state with an average of 18 vibrational quanta. In this 
regime, the gate operation is capable of mapping |11) 
onto a state |00) + e 4 *|ll) with a fidelity of about 98%. 
Taking this fidelity as being indicative of the gate fidelity, 
one might expect errors of about 4% in each measurement 
of spin correlations ai®o~j as the gate is carried out twice, 
once before and once after the fluorescence measurement. 

These error sources prevented us from testing a gen- 
eralization of inequality ()15|) as discussed in subsection 
IIIIEI Measurement of the correlations (BiB 3 \BiA 2 B 3 ) 
and (C1C4IC1A2-B3C4) resulted in error terms p eTT that 



were about 0.06 for sequences involving three measure- 
ments and about 0.1 for sequences with four measure- 
ments, i. e. twice as big as required for observing a viola- 
tion of (|15p . However, the experimental errors were small 
enough to demonstrate a violation of the CHSH-like in- 
equality (J2SJ) j valid for nonperfectly compatible observ- 
ables [4J]. A test of the inequality ([3T))) would become 
possible if the error rates could be further reduced. 



V. CONTEXTUAL HV MODELS 

In this section we will introduce two HV models which 
are contextual in the sense of Eq. © and violate the in- 
equalities discussed in Sec. HH We first discuss a simple 
model which violates inequality (|25p . and then a more 
complex one, which reproduces all measurement results 
for a (finite-dimensional) quantum mechanical system. 
These models are useful to point out which counterintu- 
itive properties a HV model must have to reproduce the 
quantum predictions, and which further experiments can 
rule out even these models. 



A. A simple HV model leading to a violation of 
inequality (1251) 

We will show here that violation of inequality (f25j) can 
be achieved simply by allowing the HV model to remem- 
ber what measurements have been performed and what 
the outcome was. The basic idea of the model is very 
simple (cf. the more complicated presentation in [!§]). 

The task is to construct a simple HV model for our 
four dichotomic observables A, B, C, and D. The HV 
A is taken to be a quadruple with entries taken from 
the set {+,—,©,0}, the latter two cases will be called 
"locked" in what follows, signifying that the value is un- 
changed whenever a compatible measurement is made. 
For convenience, we can write A = (A + , B + , C + , D + ) or 
A = (A + , B~ , C®, D e ), etc, and we take the initial distri- 
bution to be probability 1/2 of either (A + , B+, C+, D+) 
or (A~ , B~ , C~, D~). The measurement of an observable 
is simply reporting the appropriate sign, and locking the 
value in the position. To make the model contextual, we 
add the following mechanism: 

(a) If A is measured, then the sign of D is reversed and 

locked unless it is locked. 

(b) If D is measured, then the sign of A is reversed and 

locked unless it is locked. 

For the case A = (A + , B + ,C + , D + ), the measurement 
results when measuring inequality (|25p will be as follows. 

(i) Measurement of A\ will yield A^ and A = 
(A®, B + , C + , D e ), and for the next measurement 
one obtains B 2 or D 2 . 
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(ii) Measurement of B\ will yield _B + and A = 
(A + ,B®,C+,D+), and further one obtains A+B+ 



or C+B£ . 



(iii) Measurement of C\ will yield and A = 

(A+,B+,C®,D+), and we'll obtain B+ or D+ af- 
terwards. 



(iv) 



Measurement of D\ will yield D\ and A = 
(A G ,B + ,C + ,D e ), and we'll obtain C^D+ or 
A 2 . The last is because a measurement of 
A 2 will not change D® since it is locked. In 
this case, after a measurement of A2 the HVs are 
A= (A e ,B+,C+,D®). 



The case A = (A ,B ,C ,D ) is the same with re- 
versed signs. This means that 

(AiB 2 ) = (C\B 2 ) = (C X D 2 ) = -(A 1 D 2 ) = 1, (40) 



and 



p c "[BiA 2 B 3 



^[BrCiBz] 
p^iD^Ds] = 0. 



(41) 



Hence, this model leads to the maximal violation of 
Eq. <J2S1). 

In this model, the obscrvables A and D are compatible 
in the sense of Definition 1, but they maximally violate 
the noncontextuality condition in Eq. ([3]). It is easy to 
verify that p fllp [^4D] = 1, so that the assumption (j2"3")) 
does not hold. We argue that in this model, the change 
in the outcome D cannot be explained as merely due 
to a disturbance of the system from the experimental 
procedures when measuring A\. It should therefore be no 
surprise that the inequality ([25]) is violated by the model. 
Finally, note that a model behavior like this would create 
problems in any argument to establish noncontextuality 
via repeatability of compatible measurements, even for 
joint measurements as discussed in Section HIK. and not 
only in the sequential setting used here. 



In addition, the measurement apparatus will modify the 
quantum state according to 



tr(n^)' 



(42) 



depending on the measurement result ±1. 

This behavior can exactly be mimicked by a HV model, 
if we allow the value of the HV to be modified by the 
action of the measurement. If % is the Hilbert space of 
the quantum system, we use two types of HVs. First, we 
use parameters 0<A A <1,0<A B <1, etc. for each 
observable A, B, etc. and second we use a normalized 
vector \ip) e %. 

Then, for given values of all these parameters, we asso- 
ciate to any observable the measurement result as follows: 
We define q A = (-0 |II^ | V J ) and let the model predict the 
measurement result: — 1 if X A < q A , and +1 if q A < X A . 
Furthermore, depending on the measurement result, the 
values of the HVs X A and \ip) change according to 



A 1— » 



A/* 
q A 
X A 



and 



1 



n-: 



if A < q 1 



if A > q' 



if A < q* 



if A > 



(43) 



(44) 



Let us now fix the initial probability distribu- 
tion of the HVs. The experimentally accessible 
probability distributions p(X A ,X B 7 ... ;ip) shall not 
depend on the parameters X A , X B , . . . , that is, 
p(X A , X B ,. . . ; if)) = p(X' A , A' B , . . . ; -0). Hence we write 
p(<P) = J dX A dX B ■ --p(X A , ... ; V)- The probability dis- 
tribution p(V>) and the measure dip are chosen such that 



B. A HV model explaining all quantum mechanical 
predictions 

Let us now introduce a detailed HV model which re- 
produces all the quantum predictions for sequences of 
measurements. In a nutshell, this contextual HV model 
is a translation of a machine that classically simulates a 
quantum system. 

We consider the case that only dichotomic measure- 
ments arc performed on the quantum mechanical sys- 
tem. Therefore, any observable A decomposes into A = 
I1 A — H A with orthogonal projectors n + and n_. For 
a mixed state g, a measurement of this observable pro- 
duces the result +1 with probability p(A + ) = tr(n^p), 
and the result —1 with probability p(A~) = tr(n^p). 



Q P = dip p(ip)\ip)(ip\ 



(45) 



is the corresponding quantum state. 

We now verify that this model indeed reproduces the 
quantum predictions. If the initial distribution is p, then 
the probability to obtain the result —1 for A is given by 



A <g 



dX A dip p(ip) = dip (tp\U A \iP) p{ip) 



(46) 



and hence is in agreement with the quantum prediction. 
Due to the transformations in Eq. (|43|) and Eq. (|44|) . 
the probability distribution changes by the action of the 
measurement, p i— > p' . The new distribution p' again 
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does not depend on X A and, in case of the measurement 
result — 1, we have 



where S denotes Dirac's ^-distribution and q' = 
(tp' [n. A \ip') . The new corresponding mixed state is given 
by 



Qp> 



n A e . p u A 



(48) 



tr(e P n^) " 

This demonstrates that the transformation in Eq. (|42|) is 
suitably reproduced by g p n- g p >. An analogous calcula- 
tion can be performed for the measurement result +1. 

Let us illustrate that this model is actually contextual, 
as defined in Eq. ([3J. As an example, we choose two 
commuting observables A = H A — H A and B = 11^ — 
with the property that, for some pure state \ifi), we 
have (ip\AiB2\tp) = +1, while (ip\B\ip) < 1. An example 
would be A = a z ® 1 and B = — 1 <X> a z with \ip) being 
the singlet state. Then, after a measurement of Ai, the 
result of a subsequent measurement of B2 is fixed and 
hence independent of X B . However, if B is measured 
without a preceding measurement of A, then the result 
of Swill be -1 if X B < (V>|II^|V>), and +1 else. Hence, in 
our particular model, given the preparation of \ip), v{B\) 
depends on X B , while v(B2\AiB 2 ) only depends on X A . 
However, the model does not allow special correlations 
between X A and X B and hence the model is contextual, 
i.e., necessarily there are experimentally accessible values 
of the HVs, such that Eq. Q is violated. 



VI. CONCLUSIONS 



These experiments must satisfy some requirements 
which are not explicitly needed for tests of Bell inequal- 
ities. An important requirement is that one has to test 
experimentally, to which extent the implemented mea- 
surements are indeed compatible. In this paper, we have 
discussed how to deal with the inevitable errors, prevent- 
ing us from implementing perfectly compatible measure- 
ments. The problem of not-pcrfectly compatible observ- 
ables is not fatal, but should be taken into account with 
care. 

We have presented three approaches how additional re- 
quirements can be used to exclude the possibility of non- 
contextual explanations of the experimental results, and 
we have applied them to three specific inequalities of par- 
ticular interest: a CHSH-likc noncontcxtuality inequal- 
ity using sequential measurements on individual systems, 
which can be violated by specific states of four or more 
levels, a KCBS noncontextuality inequality using sequen- 
tial measurements on individual systems, which can be 
violated by specific states of three or more levels, and 
a KS inequality coming from the Mermin-Peres square 
which is violated by any state of a four-level system. 
Similar methods can be applied to any noncontextuality 
inequality, irrespective of the number of sequential mea- 
surements or the dimensionality of the Hilbert space. 

The main motivation was to provide experimentalists 
with inequalities to rule out noncontextual HV models 
unambiguously, if some additional assumptions are made. 
We have shown that a recent experiment with trapped 
ions already ruled out some of these HV models. By pro- 
viding examples of HV models, we have seen that these 
extra assumptions are not necessarily satisfied by very 
artificial HV models. Nevertheless they lead to natural 
extensions of the assumption of noncontcxtuality, and 
allow us to reach conclusions about HV models in realis- 
tic experiments with nonperfect devices. An interesting 
line of future research will be to investigate how these ex- 
tra assumptions can be replaced by fundamental physical 
principles such as locality in experiments where the sys- 
tem under observation is entangled with a distant system 
on which additional measurements can be performed. 



Experimental quantum contextuality is a potential 
source of new applications in quantum information pro- 
cessing, and a chance to expand our knowledge on 
the reasons why quantum resources outperform classical 
ones. In some sense, experimental quantum contextual- 
ity is an old discipline, since most Bell experiments are 
just experiments ruling out noncontextual HV models, 
since they do not fulfill the required spacclikc separa- 
tion needed to invoke locality as a physical motivation 
behind the assumption of noncontextuality. The possi- 
bility of observing state-independent quantum contextu- 
ality, however, is a recent development. It shows that the 
power of QM is not necessarily in some particular states, 
but also in some sets of measurements which can reveal 
nonclassical behavior of any quantum state. 
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Appendix 

In Sec. Ill CI we discussed the notion of compatibility 
for subsequent measurements. In this Appendix we pro- 
vide two examples which demonstrate that both parts 
of Definition 1 arc independent. We then show that the 
statement of compatibility can be simplified to involve 
sequences of length 2 only. 

a. Mutual independence of Definition 1 (i) and Defi- 
nition 1 (ii). For an example that (i) does not include 
(ii), assume that the expectation value of A depends on 
whether the first measurement in the sequence is A or 
B. Then (A 1 \A 1 B 2 ) ^ (A 2 \B 1 A 2 ) and hence condition 
(ii) is violated. However, such a model is not in conflict 
with condition (i) , if once A was measured, the value of 
A stays unchanged for the rest of the sequence. 

For the converse, assume a HV model where the ex- 
pected value (A) does not depend on the results of any 
previous measurement. Then, for any sequence and any 
k, (A) = (Ak\S) and, hence, condition (ii) is satisfied. 
However, p(A^A 2 \A 2 A 2 ) > 0, unless (AiA 2 ) = 1, and 
thus condition (i) is violated. 

b. Compatibility for sequences of length 2. Assume 
that, for any preparation procedure, A and B obey 

(A,) = (A t \AiA 2 ) = {A^BxAt), (49) 

i.e., condition (ii) of Definition 1 is satisfied for sequences 
of length 2. Then, for a sequence S of length k we have 



either S = S'B or S = S'A, where 5" is a sequence of 
length k — 1. In a measurement of 5, we can consider S' 
to be part of the preparation procedure and then apply 
Eq. (gSJl. It follows that (A k+1 \SA) = (A k \S'A) and 
eventually (Ak+i\SA) = (Ai) by induction. 

In a similar fashion we reduce condition (i) of Defini- 
tion 1 for dichotomic observables. For an experimentally 
accessible probability distribution p oxp (A), we denote by 
Pexp(A) the distribution obtained by a measurement of A 
and a postsclcction of the result +1. Then, for a sequence 
S of length fc, 

p{A+A- k+2 \ASA)=p{AZ +1 \SA)p(A+\A 1 ) 
= p(A^\A)p(A+\A) 
= P (A+A 2 \A 1 A 2 ), (50) 

where for the second equality we used that (Ak + i\SA) = 
(Ai) holds for p. It follows that a set of dichotomic ob- 
servables 3 is compatible if and only if, for any prepara- 
tion and any A, B e 3, (AiA 2 ) = 1 and Eq. (g§J) holds. 
In particular, this proves the assertion that pairwise com- 
patibility of three or more observables is equivalent to an 
extended definition of compatibility involving sequences 
of all compatible observables. 
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